An empirical investigation of missing data handling in cloud node failure prediction
- Creator: Ma, Minghua , Liu, Yudong , Rajmohan, Saravanakumar , Lin, Qingwei , Tong, Yuang , Li, Haozhe , Zhao, Pu , Xu, Yong , Zhang, Hongyu , He, Shilin , Wang, Lu , Dang, Yingnong
- Resource Type: conference paper
- Date: 2022
Multi-task Hierarchical Classification for Disk Failure Prediction in Online Service Systems
- Creator: Liu, Yudong , Yang, Hailan , Zhang, Chenjian , Wang, Paul , Dang, Yingnong , Rajmohan, Saravan , Zhang, Dongmei , Zhao, Pu , Ma, Minghua , Wen, Chengwu , Zhang, Hongyu , Luo, Chuan , Lin, Qingwei , Yi, Chang , Wang, Jiaojian
- Resource Type: conference paper
- Date: 2022
NENYA: Cascade Reinforcement Learning for Cost-Aware Failure Mitigation at Microsoft 365
- Creator: Wang, Lu , Zhao, Pu , Zhang, Hongyu , Rajmohan, Saravan , Zhang, Dongmei , Du, Chao , Luo, Chuan , Su, Mengna , Yang, Fangkai , Liu, Yudong , Lin, Qingwei , Wang, Min , Dang, Yingnong
- Resource Type: conference paper
- Date: 2022
AutoCCAG: An Automated Approach to Constrained Covering Array Generation
- Creator: Luo, Chuan , Lin, Jinkun , Rajmohan, Saravanakumar , Zhang, Dongmei , Cai, Shaowei , Chen, Xin , He, Bing , Qiao, Bo , Zhao, Pu , Lin, Qingwei , Zhang, Hongyu , Wu, Wei
- Resource Type: conference paper
- Date: 2021
Correlation-Aware Heuristic Search for Intelligent Virtual Machine Provisioning in Cloud Systems
- Creator: Luo, Chuan , Qiao, Bo , Xing, Wenqian , Chen, Xin , Zhao, Pu , Du, Chao , Yao, Randolph , Zhang, Hongyu , Wu, Wei , Shaowei, Cai , Bing, He , Saravanakumar, Rajmohan , Qingwei, Lin
- Resource Type: conference paper
- Date: 2021
Effective low capacity status prediction for cloud systems
- Creator: Dong, Hang , Qin, Si , Abuduweili, Abulikemu , Ramanujan, Sanjay , Subramanian, Karthikeyan , Zhou, Andrew , Rajmohan, Saravanakumar , Zhang, Dongmei , Moscibroda, Thomas , Xu, Yong , Qiao, Bo , Zhou, Shandan , Yang, Xian , Luo, Chuan , Zhao, Pu , Lin, Qingwei , Zhang, Hongyu
- Resource Type: conference paper
- Date: 2021
Fast Outage Analysis of Large-Scale Production Clouds with Service Correlation Mining
- Creator: Wang, Yaohui , Li, Guozheng , Xu, Zhangwei , Zhao, Pu , Qiao, Bo , Li, Liqun , Zhang, Xu , Lin, Qingwei , Wang, Zijian , Kang, Yu , Zhou, Yangfan , Zhang, Hongyu , Gao, Feng , Sun, Jeffrey , Yang, Li , Lee, Pochian
- Resource Type: conference paper
- Date: 2021
Fighting the Fog of War: Automated Incident Detection for Cloud Systems
- Creator: Li, Liqun , Zhang, Xu , Gao, Feng , Yang, Li , Lin, Qingwei , Rajmohan, Saravanakumar , Xu, Zhangwei , Zhang, Dongmei , Zhao, Xin , Zhang, Hongyu , Kang, Yu , Zhao, Pu , Qiao, Bo , He, Shilin , Lee, Pochian , Sun, Jeffrey
- Resource Type: conference paper
- Date: 2021
How long will it take to mitigate this incident for online service systems?
- Creator: Wang, Weijing , Chen, Junjie , Xu, Zhangwei , Dang, Yingnong , Zhang, Dongmei , Yang, Lin , Zhang, Hongyu , Zhao, Pu , Qiao, Bo , Kang, Yu , Lin, Qingwei , Rajmohan, Saravanakumar , Gao, Feng
- Resource Type: conference paper
- Date: 2021
NTAM: Neighborhood-temporal attention model for disk failure prediction in cloud platforms
- Creator: Luo, Chuan , Zhao, Pu , Zhang, Dongmei , Qiao, Bo , Wu, Youjiang , Zhang, Hongyu , Wu, Wei , Lu, Weihai , Dang, Yingnong , Rajmohan, Saravanakumar , Lin, Qingwei
- Resource Type: conference paper
- Date: 2021
PULNS: Positive-Unlabeled Learning with Effective Negative Sample Selector
- Creator: Luo, Chuan , Zhao, Pu , Lin, Qingwei , Chen, Chen , Qiao, Bo , Du, Chao , Zhang, Hongyu , Wu, Wei , Cai, Shaowei , He, Bing , Rajmohan, Saravanakumar
- Resource Type: conference paper
- Date: 2021
How to mitigate the incident? An effective troubleshooting guide recommendation technique for online service systems
- Creator: Jiang, Jiajun , Lu, Weihai , Chen, Junjie , Lin, Qingwei , Zhao, Pu , Kang, Yu , Zhang, Hongyu , Xiong, Yingfei , Gao, Feng , Xu, Zhangwei , Dang, Yingnong , Zhang, Dongmei
- Resource Type: conference paper
- Date: 2020
Identifying linked incidents in large-scale online service systems
- Creator: Chen, Yujun , Yang, Xian , Dong, Hang , He, Xiaoting , Zhang, Hongyu , Lin, Qingwei , Chen, Junjie , Zhao, Pu , Kang, Yu , Gao, Feng , Xu, Zhangwei , Zhang, Dongmei
- Resource Type: conference paper
- Date: 2020
Intelligent virtual machine provisioning in cloud computing
- Creator: Luo, Chuan , Qiao, Bo , Chen, Xin , Zhao, Pu , Yao, Randolph , Zhang, Hongyu , Wu, Wei , Zhou, Andrew , Lin, Qingwei
- Resource Type: conference paper
- Date: 2020
Towards Intelligent Incident Management: Why We Need It and How We Make It
- Creator: Chen, Zhuangbin , Kang, Yu , Li, Liqun , Zhang, Xu , Zhang, Hongyu , Xu, Hui , Zhou, Yangfan , Yang, Li , Sun, Jeffrey , Xu, Zhangwei , Dang, Yingnong , Gao, Feng , Zhao, Pu , Qiao, Bo , Lin, Qingwei , Zhang, Dongmei , Lyu, Michael R.
- Resource Type: conference paper
- Date: 2020